Proteochemometric modelling coupled to in silico target prediction: an integrated approach for the simultaneous prediction of polypharmacology and binding affinity/potency of small molecules
نویسندگان
چکیده
The rampant increase of public bioactivity databases has fostered the development of computational chemogenomics methodologies to evaluate potential ligand-target interactions (polypharmacology) both in a qualitative and quantitative way. Bayesian target prediction algorithms predict the probability of an interaction between a compound and a panel of targets, thus assessing compound polypharmacology qualitatively, whereas structure-activity relationship techniques are able to provide quantitative bioactivity predictions. We propose an integrated drug discovery pipeline combining in silico target prediction and proteochemometric modelling (PCM) for the respective prediction of compound polypharmacology and potency/affinity. The proposed pipeline was evaluated on the retrospective discovery of Plasmodium falciparum DHFR inhibitors. The qualitative in silico target prediction model comprised 553,084 ligand-target associations (a total of 262,174 compounds), covering 3,481 protein targets and used protein domain annotations to extrapolate predictions across species. The prediction of bioactivities for plasmodial DHFR led to a recall value of 79% and a precision of 100%, where the latter high value arises from the structural similarity of plasmodial DHFR inhibitors and T. gondii DHFR inhibitors in the training set. Quantitative PCM models were then trained on a dataset comprising 20 eukaryotic, protozoan and bacterial DHFR sequences, and 1,505 distinct compounds (in total 3,099 data points). The most predictive PCM model exhibited R (2) 0 test and RMSEtest values of 0.79 and 0.59 pIC50 units respectively, which was shown to outperform models based exclusively on compound (R (2) 0 test/RMSEtest = 0.63/0.78) and target information (R (2) 0 test/RMSEtest = 0.09/1.22), as well as inductive transfer knowledge between targets, with respective R (2) 0 test and RMSEtest values of 0.76 and 0.63 pIC50 units. Finally, both methods were integrated to predict the protein targets and the potency on plasmodial DHFR for the GSK TCAMS dataset, which comprises 13,533 compounds displaying strong anti-malarial activity. 534 of those compounds were identified as DHFR inhibitors by the target prediction algorithm, while the PCM algorithm identified 25 compounds, and 23 compounds (predicted pIC50 > 7) were identified by both methods. Overall, this integrated approach simultaneously provides target and potency/affinity predictions for small molecules. Graphical abstractProteochemometric modelling coupled to in silico target prediction.
منابع مشابه
Prediction of In Silico ADME Properties of 1,2-O-Isopropylidene Aldohexose Derivatives
Retention behavior of molecules mostly depends on their chemical structure. Retention data of biologically active molecules could be an indirect relationship between their structure and biological or pharmacological activity, since the molecular structure affects their behavior in all pharmacokinetic stages. In the present paper, retention parameters (RM0) of biologically active 1,2-O-isopropyl...
متن کاملPrediction of In Silico ADME Properties of 1,2-O-Isopropylidene Aldohexose Derivatives
Retention behavior of molecules mostly depends on their chemical structure. Retention data of biologically active molecules could be an indirect relationship between their structure and biological or pharmacological activity, since the molecular structure affects their behavior in all pharmacokinetic stages. In the present paper, retention parameters (RM0) of biologically active 1,2-O-isopropyl...
متن کاملNovel Small Molecules against Two Binding Sites of Wnt2 Protein as potential Drug Candidates for Colorectal Cancer: A Structure Based Virtual Screening Approach
Wnts are the major ligands responsible for activating Wnt signaling pathway through binding to Frizzled proteins (Fzd) as the receptors. Among these ligands, Wnt2 plays the main role in the tumorigenesis of several human cancers especially colorectal cancer (CRC). Therefore, it can be considered as a potential drug target.The aim of this study was to identify potential drug candidates ...
متن کاملDesign of a humanized anti vascular endothelial growth factor nanobody and evaluation of its in vitro function
Objective(s): Nanobodies, the single domain antigen binding fragments of heavy chain-only antibodies occurring naturally in camelid sera, are the smallest intact antigen binding entities. Their minimal size assists in reaching otherwise largely inaccessible regions of antigens. However, their camelid origin raises a possible concern of immunogenicity when used for human therapy. Humanization is...
متن کاملNovel Small Molecules against Two Binding Sites of Wnt2 Protein as potential Drug Candidates for Colorectal Cancer: A Structure Based Virtual Screening Approach
Wnts are the major ligands responsible for activating Wnt signaling pathway through binding to Frizzled proteins (Fzd) as the receptors. Among these ligands, Wnt2 plays the main role in the tumorigenesis of several human cancers especially colorectal cancer (CRC). Therefore, it can be considered as a potential drug target.The aim of this study was to identify potential drug candidates ...
متن کامل